Published online 14 May 2014 



Nucleic Acids Research, 2014, Vol 42, No. 11 7383-7394 

doi: 10.1093lnarlgku338 



Mechanical properties of symmetric and asymmetric 
DNA A-tracts: implications for looping and 
nucleosome positioning 

Tomas Drsata 12 , Nada Spackova 2 , Petr Jurecka 3 , Marie Zgarbova 3 , Jirf Sponer 2 4 and 
Filip Lankas 1 '* 

1 1nstitute of Organic Chemistry and Biochemistry, Academy of Sciences of the Czech Republic, Flemingovo nam. 2, 
16610 Prague, Czech Republic, institute of Biophysics, Academy of Sciences of the Czech Republic, Kralovopolska 
135, 61265 Brno, Czech Republic, 3 Regional Centre of Advanced Technologies and Materials, Department of 
Physical Chemistry, Faculty of Science, Palacky University, 17. listopadu 12, 77146 Olomouc, Czech Republic and 
4 CEITEC-Central European Institute of Technology, Campus Bohunice, Kamenice 5, 62500 Brno, Czech Republic 



Received January 31, 2014; Revised April 07, 2014; Accepted April 9, 2014 



ABSTRACT 

A-tracts are functionally important DNA sequences 
which induce helix bending and have peculiar 
structural properties. While A-tract structure has 
been qualitatively well characterized, their mechani- 
cal properties remain controversial. A-tracts appear 
structurally rigid and resist nucleosome formation, 
but seem flexible in DNA looping. In this work, we in- 
vestigate mechanical properties of symmetric A n T n 
and asymmetric A 2n tracts for n = 3, 4, 5 using 
two types of coarse-grained models. The first model 
represents DNA as an ensemble of interacting rigid 
bases with non-local quadratic deformation energy, 
the second one treats DNA as an anisotropically 
bendable and twistable elastic rod. Parameters for 
both models are inferred from microsecond long, 
atomic-resolution molecular dynamics simulations. 
We find that asymmetric A-tracts are more rigid 
than the control G/C-rich sequence in localized dis- 
tortions relevant for nucleosome formation, but are 
more flexible in global bending and twisting relevant 
for looping. The symmetric tracts, in contrast, are 
more rigid than asymmetric tracts and the control, 
both locally and globally. Our results can reconcile 
the contradictory stiffness data on A-tracts and sug- 
gest symmetric A-tracts to be more efficient in nu- 
cleosome exclusion than the asymmetric ones. This 
would open a new possibility of gene expression ma- 
nipulation using A-tracts. 



INTRODUCTION 

A-tracts are commonly defined as DNA sequences of at 
least four consecutive A-T base pairs without an intervening 
TA step. When embedded in a general sequence, they bend 
the DNA double helix towards the minor groove in the cen- 
tre of the A-tract. The bending magnitude depends on the 
A-tract length and sequence as well as on temperature and 
ionic composition of the buffer. A-tracts exhibit a particu- 
lar structure characterized by large negative propeller and a 
minor groove progressively narrowing in the 5' to 3' direc- 
tion of the adenine strand. They also have a spine of hydra- 
tion in the minor groove. This unique structure is formed 
cooperatively when passing from three to four A-T pairs in 
a row. At elevated temperatures, A-tracts undergo a coop- 
erative pre-melting transition in which their unique confor- 
mation is transformed into a more B-DNA-like structure. 
Properties of A-tracts and their prominent importance in 
DNA biology and biophysics have been reviewed multiple 
times (1-6). In particular, A-tracts affect nucleosome posi- 
tioning which, in turn, is critical for regulating gene expres- 
sion (7,8). By manipulating A-tracts, gene expression can be 
tuned in a predictable manner (9,10). 

While structural features of A-tracts seem to be qualita- 
tively well established, their mechanical properties are not 
fully understood. Crystallographic studies suggest that A- 
tracts are conformationally rigid. Although the crystallized 
oligomers are bent in very different directions dictated by 
crystal packing, the A-tracts themselves are almost straight 
and have similar conformations (4,11,12). Early molecular 
dynamics (MD) simulations also indicated a rigid A-tract 
(13-15). More recent simulation studies report that fluc- 
tuations of twist, rise and minor groove width in the A- 
tract are always smaller than in the flanking sequences and 
do not change with imposed external bending (16,17). Nu- 
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clear magnetic resonance (NMR) carbon spin relaxation 
combined with MD simulations suggest exceptionally small 
sugar flexibility within A-tracts (18). Another recent MD 
work (19) has found that water bridge occupancies in the 
spine of hydration correlate with DNA stiffness assessed us- 
ing the standard local dinucleotide model (20-23). Consis- 
tent with the view of a ridig A-tract, it has been shown that 
TATA boxes containing 3-4 consecutive adenines are best 
described as context-independent structures conforming to 
a nearest-neighbour non-additive model of protein binding 
(24). Imino proton resonance measurements show that A-T 
base pairs within A-tracts have lifetimes at least an order of 
magnitude longer than in other B-DNA duplexes (25). 

However, A-tracts may not be exceptionally rigid in all 
circumstances. Cyclization kinetics data suggest that the A4 
sequence is not more rigid, both in bending and in twist- 
ing, than many other tetranucleotide sequences (26). An- 
other cyclization study has found bending rigidity of a se- 
quence with phased A 6 tracts to be similar to that of generic 
DNA (27). In fluorescence-based, protein-free cyclization 
experiments (28), inserting A n tracts in the middle of a ran- 
dom non-A-tract sequence resulted in prolongation of loop- 
ing times, but for some tracts (n = 10, 17), these times 
were shorter than looping times of another non-A-tract 
sequence. A very recent work reports that A n tracts ap- 
pear highly flexible in the context of transcription factor- 
mediated DNA looping (29). 

Another open question concerns properties of the sym- 
metric A n T n tracts compared to asymmetric, homopoly- 
meric A2 n tracts of the same length. It was discovered 
early on that phased A3T3 and A4T4 tracts migrate anoma- 
lously slowly in gel electrophoresis (30,31). Gel mobilities 
of phased A4T4 were found to be similar to the properties 
of A6 (32). A later study reported a slightly lower bending 
induced by A3T3 compared to A6 in the same sequence con- 
text, but the difference was judged to be only marginally 
significant (33). Another work also found phased A4T4 to 
migrate anomalously slowly, albeit faster than A6 (34). Just 
like A2n tracts, A n T n have exceptionally long base pair life- 
times (25). Crystal structures (35,36), NMR data (34) and 
molecular modelling (6,13,37,38) also suggest that symmet- 
ric and asymmetric tracts share similar basic characteris- 
tics. On the other hand, the two A-tract types can hardly be 
entirely identical, due to the palindromic sequence and the 
presence of an AT step in the symmetric tract. Indeed, it has 
been proposed that the structural basis of bending induced 
by symmetric and asymmetric tracts may differ (35). 

In this work we set out to investigate structure and me- 
chanical properties of A-tracts using molecular modelling. 
We focus on A3T3, A4T4 and A5T5 symmetric tracts and 
their asymmetric counterparts A6, Ag and A10, all embed- 
ded in a non-A-tract sequence. To describe their properties, 
we use two types of coarse-grained DNA models (Figure 1). 

The first model represents DNA as an ensemble of in- 
teracting rigid bases with an underlying quadratic, non- 
local deformation energy function (22,23,39-41). Every 
base can in principle be coupled to any other base in the 
oligomer, yielding a full description of base-base interac- 
tions in the harmonic approximation. This substantially ex- 
tends the standard local models in which individual dinu- 
cleotide steps (20,21) or individual base pairs (42,43) are 




Figure 1. The multiscale modelling approach used in this work. Structures 
from all-atom MD (left) are used to infer parameters of a non-local rigid 
base model. Averaging base-fixed frames of selected bases yields mean 
frames which, in turn, define a description of the DNA oligomer as a piece 
of anisotropically bendable and twistable elastic rod (right). The rigid base 
diagrams were produced with 3DNA. 

characterized by 6D quadratic deformation energies. The 
local models have already been successful in problems, such 
as sequence-dependent nucleosome positioning, identifying 
transcription factor binding sites and promoter location, 
or unravelling basic mechanical features of DNA (44-57). 
Nevertheless, the local models necessarily miss conforma- 
tional couplings between intra-basepair and inter-basepair 
coordinates, some of which are well documented (58). They 
also neglect all couplings between different base pairs or 
steps. These long-range interactions are expected to be par- 
ticularly important in A-tracts with their delocalized, inter- 
locked structure, and are properly accounted for by the non- 
local description used here. 

The second model we employ in this work represents a 
whole DNA fragment as an elastic rod with anisotropic 
bending and twisting rigidities. It is close in spirit to the de- 
scription already proposed some years ago (59). Parameters 
for both the rigid base and the elastic rod models are con- 
sistently deduced from extensive, state-of-the-art atomic- 
resolution MD simulations with explicit inclusion of water 
and ions. We compare detailed shape and stiffness prop- 
erties of the A-tracts, and discuss their implications. By 
threading the A-tracts through a nucleosome structure, we 
demonstrate the difference between symmetric and asym- 
metric A-tracts with respect to nucleosome positioning. Our 
results should help to better understand the behaviour of 
A-tracts in DNA looping and nucleosome positioning. In 
particular, our data suggest that using A n T n tracts instead 
of A2n tracts may open yet another possibility to modulate 
nucleosome positioning and thus affect gene regulation. 

MATERIALS AND METHODS 

Model of interacting rigid bases 

In this model we describe a DNA oligomer as an ensem- 
ble of interacting rigid bodies representing individual bases 
(22,23,39^1). The relative orientation and displacement of 
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the bases is defined by intra-basepair coordinates buckle, 
propeller, opening, shear, stretch and stagger, and inter- 
basepair or step coordinates tilt, roll, twist, shift, slide and 
rise (58,60). For a DNA molecule of n base pairs, there are 
N = 12^ — 6 coordinates which we write as components of 
a vector w. The base-base interactions are characterized by 
a deformation energy of general quadratic form 

E(\\ ) = ^ (w — w) • K (w — w) = 



to w#, 



*j) 



(1) 



z,7=l 



where w is the vector of coordinates defining the energy 
minimum (shape parameters) and K is the stiffness matrix. 
Thus, E(w) is the energy required to distort the coordi- 
nates w away from their equilibrium values w. The model 
is considered to be in contact with a thermal bath of tem- 
perature T. Assuming small fluctuations of the coordinates, 
their probability distribution can be well approximated by 
an TV-dimensional Gaussian. Consequently, the model pa- 
rameters w and K obey the relations 



w=(w), K = k B TC~ 



(2) 



where (w) is the vector of coordinate means, C is the covari- 
ance matrix of the coordinates, k B is the Boltzmann con- 
stant and the superscript — 1 denotes the matrix inverse. 

The stiffness matrix contains all the information about 
the oligomer deformability (or flexibility) described by the 
model. Besides that, it is also desirable to characterize the 
flexibility in an overall manner by one simple parameter. To 
this end we propose to use the conformational entropy (61), 
defined by the Gibbs formula 



k B I p(w)\np(w)dw 



(3) 



Substituting the Af-dimensional Gaussian distribution for 
p (w) in Equation (3), we find 



S c = -k B ln[(2yte) N dstC] 



(4) 



where e = 2.1 IS ... is the base of the natural logarithm 
and det denotes the matrix determinant. The entropy S c , 
however, is an extensive quantity, meaning that it will in- 
crease with the number of rigid base coordinates (i.e. with 
the oligomer length). It is more convenient to work with en- 
tropy per coordinate, given by 



= S C /N 



(5) 



which is intensive and, therefore, enables one to compare 
oligomers of different lengths. 

The deformation may be specified by prescribing only 
some of the coordinates, while the others are unconstrained 
and free to relax to their energetically optimal values. For 
instance, base pair step coordinates only, or even a subset 
thereof, are often considered. To describe this situation, let 
us write the coordinate vector w as w = (w^, w 5 ), where 
only are prescribed and w 5 are left unconstrained. The 
energy associated with deforming the coordinates is ob- 
tained by minimizing the deformation energy with respect 



^(w^) = min E(w). 



Performing the minimization, we deduce that 
E(w A ) = l - (w., - w A ) • K (w^ - w A ) 



where 



w^ = <w^), K = k B TC~ 



(6) 

(7) 
(8) 



and C is the covariance matrix of the coordinates w^. Thus, 
Equations (7) and (8) of this partially relaxed model are ex- 
actly analogous to Equations (1) and (2), except that only a 
subset of the coordinates w, namely w A , is now involved. 

In particular, if contains only one coordinate, say twist 
co a of the base pair step a, then Equation (8b) reduces to 



K a = 



k B T 



(oO a ~ {(Da)) 2 } 



(9) 



and gives the force constant K a associated with imposing an 
excess twist to step a, with all the other coordinates in the 
oligomer unconstrained. The quantity in the denominator 
is the variance (or square SD) of co a . 

Finally, let contain the six base pair step coordinates 
of step a. Equations (7) and (8) are then just the usual rela- 
tions for the local dinucleotide model. The meaning of the 
6x6 dinucleotide stiffness matrix, reported by various au- 
thors, is now clear: it is the stiffness matrix associated with 
a deformation where only the coordinates of that particular 
base pair step are prescribed, while all the other coordinates 
in the oligomer remain unconstrained. 

Elastic rod model 

In this coarser description, the conformation of the whole 
oligomer is characterized by just three coordinate frames, 
two of them located at its ends and one in the middle (Fig- 
ure 1). The x-axis of the middle frame points to the major 
groove in the centre of the oligomer. Contrary to an earlier 
similar approach (59), we do not rely on the optimal curvi- 
linear helical axis, since it may introduce artefacts to elas- 
tic properties (62). Instead, we obtain the coordinate frames 
by averaging base-fixed standard reference frames at the ap- 
propriate locations (Figure 1). The magnitude # of bending 
is the angle between the z-axes of the two end frames, the 
bending direction cp is measured with respect to the major 
groove in the oligomer centre, with cp = 0 for bending to- 
wards the major groove and cp = jt (or 180°) for bending 
towards the minor groove. The procedure was described in 
detail earlier (6). Besides # and (p, it will be convenient here 
to introduce the quantities p and r (which we call global roll 
and global tilt, respectively) by the relations 



p = # cos cp, x = # sin cp 



(10) 



To complement the structural description, we introduce 
the total twist co, computed as the sum of local dinucleotide 
twists between the end frames (half the values are taken 
at end steps). Assuming a quadratic deformation energy 
and small fluctuations of the coordinates p, r and co, we 
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can adapt the formalism of Equations (1) and (2). Stiff- 
ness constants per unit length have to be introduced to ob- 
tain length-independent material properties. It is also con- 
venient to express the stiffness constants in units of length, 
in analogy with persistence length of semi-flexible polymers. 
The deformation energy then takes the form 

1 T 1 

^(u)=^-(u-u)-K r (u-u) (11) 

where u = (p, r, co) is the vector of coordinates (in radians), 
u is the coordinate vector specifying the energy minimum, 
K r the 3x3 stiffness matrix and /o the equilibrium length 
of the oligomer. The length / 0 is computed as the sum of 
local rises (half the values are taken at end steps), averaged 
over the MD trajectory. The model parameters u and K r are 
inferred from the fluctuations of u using the relations exactly 
analogous to Equation (2). 

Sequence design, MD simulations and model parametrization 

Seven 18-base pair (bp) DNA sequences have been inves- 
tigated. The control C/G-rich sequence comes from the 
chicken (3 A -globin promoter and was used earlier in nucle- 
osome positioning studies (63,64). We then modified the 
control sequence to include in its centre the A3T3, A4T4 
and A5T5 symmetric tracts, and their asymmetric counter- 
parts A6, Ag and Aiq. All the tracts are consistently em- 
bedded in the 5 / -G and C-3' sequence context. The list of 
sequences is in Table 1 . The oligomers were built as canoni- 
cal B-DNA and solvated in an octahedral periodic box with 
extended simple point charge (SPC/E) water, leaving 10 A 
between the box walls and the closest DNA atom. In the 
next step, 34 K + ions were added to neutralize the nega- 
tive DNA charge, additional 29 K + and 29 Cl~ ions were 
included to mimic the physiological concentration of 150 
mM KC1. The ions were parametrized according to Dang 
(65). Each system contains ca. 10 500 water molecules and 
33 000 atoms in total. After equilibration, production runs 
of unrestrained MD at 300 K and 1 atm were performed us- 
ing the parmbscO AMBER force field (66). The periodic box 
volume fluctuates around 334 x 10 3 A 3 , the density around 
1 g.cm -3 . The volume occupied by the DNA is about 18 x 
10 3 A 3 , which represents roughly 5% of the box volume. The 
pmemd module of the AMBER package was used to carry 
out the simulations. The protocol is very similar to that used 
by the Ascona B-DNA consortium (67) and is described in 
detail in Supplementary Methods. The trajectories were ex- 
tended to 1 microsecond each. Snapshots were recorded ev- 
ery 10 ps and analysed with 3DNA (68) to obtain time series 
of base-fixed frames, intra-basepair and step coordinates, 
backbone torsions, minor groove widths, etc. The data then 
underwent filtering to exclude snapshots with broken intra- 
basepair hydrogen bonds (cutoff distance 4 A), as proposed 
earlier (6,39,47). We tested the effect of filtration and found 
that its absence would make the stiffness constants less well 
defined. End base pairs often broke, were not filtered and 
were ignored in all data processing. The model parameters 
were computed from Equation (2) where the ensemble av- 
erages were replaced by averages over the simulated trajec- 
tories. Figure 2 shows the initial canonical B-DNA struc- 




Figure 2. Example of the simulated system. For each simulation, an 18- 
bp oligomer in canonical B-DNA conformation (left) was chosen as the 
starting structure. It was solvated by water molecules in an octahedral pe- 
riodic box, with K + and CI - ions added to neutralize the DNA charge and 
mimic the physiological salt concentration of 150 mM KC1 (middle). The 
final structure of the A3T3 oligomer after one microsecond of MD (right) 
clearly shows bending towards the minor groove in its centre (marked by 
the arrow). 

ture, the simulated system with water and ions, and the fi- 
nal structure of the A3T3 oligomer after 1 microsecond of 
simulation. 

To analyse the stiffness matrix as a whole (e.g. to compute 
the entropy), its entries have to be made dimensionally uni- 
form. To this end we nqn-dimensionalize the coordinates 
using the length scale 1 A and angle scale equal to 360/34, 
or 10.6°, as conventionally done (23,59,69). A simple com- 
putation shows that the entropy S c (Equation (4)) changes 
with coordinate rescaling, but differences of S c do not. If 
one uses just one scaling factor for all the distances and one 
for all the angles (as done here), then the differences in en- 
tropy per coordinate, s c (Equation (5)), also do not change 
with coordinate rescaling. 

RESULTS AND DISCUSSION 

Stiffness at the level of rigid bases 

It has been well documented that A n tract structure is not 
uniform, as minor groove gradually narrows in the 5' to 
y direction of the adenine strand (4,70). The anomalously 
long base pair lifetimes emerge from the second 5' end A- 
tract base pair and further increase for the third pair (25). 
At the y end, the values again return to the usual B-DNA 
properties. Chemical shift experiments indicate a zone of 
transitional structure about 4 bp on the 5' side and 2 bp 
on the y side, with possible slight extension beyond the 3' 
end into the non-A-tract sequence (71). A natural question 
then arises as to whether such gradual buildup of A n tract 
characteristics applies to stiffness as well. 

Figure 3 shows diagonal entries of the rigid base stiffness 
matrix for roll, twist and slide deformations, data for pro- 
peller are in Supplementary Figure SI. The diagonal en- 
tries are stiffness constants of a deformation in which the 
given coordinate is distorted while all the others remain 
fixed (Equation (1)). For the asymmetric A n tracts, we in- 
deed observe a continuous buildup of stiffness in the 5' to 3' 
direction, a plateau in the middle and return to values of the 
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Table 1. Sequences and their properties in the simulations. The underlined pairs include bases defining average frames at the ends and in the middle of the 
studied sequence (see Materials and Methods and Figure 1). Bending direction of 180° points towards the minor groove in the oligomer centre, 0° indicates 
bending into the major groove. The errors in parentheses are mean differences between values for the whole trajectory and for its halves. 
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Figure 3. Diagonal stiffness constants of the non-local rigid base model. Stiffness of the asymmetric, homopolymeric A2 n tracts (red) builds up gradually 
in the 5 f to 3 f direction of the adenine strand and eventually reaches a plateau. In contrast, the symmetric A n T n tracts (green) have a peak at the central 
AT step which extends over the whole tract, with no plateau. Moreover, the peak value increases with the tract length. Both A-tract types exhibit higher 
values than the control sequence (blue). Error bars in this and the other figures are mean differences between values for the whole trajectory and for its 
halves. This data exemplify the stiffness profiles of A-tracts, complementing the well documented structural profiles. They expose sharp differences between 
stiffness distribution in symmetric and asymmetric A-tracts. 



control sequence at the 3' end. The transitional regions span 
2-3 bp at each end and extend slightly beyond the 3' end of 
the tract. The observed extension of the A n tract stiffness 
beyond its 3' end (but not its 5' end) complements the earlier 
analogous finding about the A n tract structure: A n tracts af- 
fect the y flanking sequence, while the 5' sequence remains 
unaffected (72). Diagonal stiffness constants for the remain- 
ing coordinates (not shown) exhibit the same trend, with the 
exception of opening and stretch stiffness which are consis- 



tently lower and uniform within the A-tract, reflecting just 
the smaller number of hydrogen bonds in A-T pairs. 

Stiffness constants associated with another mode of de- 
formation, namely, the one where a given coordinate is dis- 
torted but all the others are free to relax (Equation (9)) fol- 
low the same pattern, only the values are now smaller, since 
the structure is less constrained (Supplementary Figures S 1 
and S2). The same trend is also observed for the diagonal 
stiffness constants of the standard local model where the 
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values are between the former two (not shown). Thus, ho- 
mopolymeric A-tract stiffness follows similar patterns as its 
structure, reflecting once again a continuous buildup of A n 
tract properties in its 5' to 3' direction. 

Stiffness profiles of the symmetric A n T n tracts are in 
striking contrast with the A2 n values (Figures 3, Supplemen- 
tary Figures SI and S2). The first expected difference stems 
from the fact that, in contrast to A2 n , the A n T n sequence 
is palindromic. Therefore (apart from the influence of the 
flanking sequeces), the stiffness profiles within the A n T n 
tract should be symmetric with respect to its centre (39), 
which is indeed almost exactly the case. Surprisingly, how- 
ever, we observe no discernible stiffness plateau within the 
A n T n tracts, not even for A 5 T 5 . The central peak keeps ris- 
ing with the tract length, suggesting that the limiting value 
has not yet been reached. This indicates a long-range influ- 
ence of the central AT step. 

The AT step is known to have remarkable properties: it 
has a sharply defined conformation in one sequence con- 
text but varies considerably between contexts (73), and it 
ranks among the stiffest dinucleotides as judged from the lo- 
cal dinucleotide model (20,21,44,47). This all distinguishes 
AT from the very flexible TA step (20,21,44,47,73). It has 
been found earlier that a TA step has only a local effect on 
surrounding A n tracts, thus T n A n are best understood as 
two adjacent A n blocks with opposite polarity (31). But can 
A n T n also be described just as two consecutive A n tracts? 
Our stiffness data indicate that they cannot, at least not for 
n < 5. Obviously, the effect of the central AT step should 
ultimately vanish upon increasing the tract length. Thus, 
a very long A n T n tract would eventually adopt the poly-A 
conformation inside its A n blocks, but our data suggest that 
n would have to be much higher than 5. 

Apart from special types of deformation, it is desirable 
to capture the overall stiffness by one simple parameter. 
The conformational volume has traditionally been used for 
this purpose (19-21,51,74). For a larger number of coordi- 
nates, however, it starts to be very large and not practical 
to use. Instead, we propose to measure the overall stiffness 
by the conformational entropy per rigid base coordinate, 
as defined by Equation (5). Besides its tractable values, this 
quantity also has the advantage of being intensive, i.e. it en- 
ables one to compare oligomers of different lengths. Fig- 
ure 4 shows entropy per coordinate for the A-tracts and for 
the corresponding parts of the control sequence. Entropy 
of the asymmetric A2 n tracts is equal or only slightly lower 
than the control. This indicates the existence of deforma- 
tion modes, others than those in Figure 3, in which A2 n is 
actually more flexible than the control. These flexible modes 
manifest themselves in global stiffness properties described 
below. In contrast, the symmetric A n T n tracts are more rigid 
(have lower entropy) than the control and their A2 n coun- 
terparts. The entropy per coordinate decreases with A-tract 
length, meaning that prolonging A-tracts results in their 
overall rigidification. 

Local structure and backbone dynamics 

Equilibrium values of the local coordinates (Supplemen- 
tary Figure S3) and minor groove widths (Supplementary 
Figure S4) confirm known structural characteristics of A- 



Ts c [kcalmoM] 




CG6A6A3T3 CG8A8A4T4 CGIOA10A5T5 



Figure 4. Entropy per rigid base coordinate s c , multiplied by T = 300 K. 
The symmetric A2 n tracts (red) have slightly lower or equal entropy com- 
pared to the control sequence (blue). In contrast, the symmetric A n T n 
tracts (green) always have lower entropy (are more rigid) than the asymmet- 
ric tracts and the control. Rigidity per coordinate increases as the A-tracts 
get longer. CGm denotes the m central pairs of the control sequence. 

tracts. Propeller of A n T n clearly reaches a plateau, whereas 
roll does not, probably reflecting once again the non-local 
effect of the central AT step. The BII populations (Supple- 
mentary Figure S5) are around 10% for the adenine strand 
and essentially zero for the thymine strand. The secondary 
peak at the second AA step from the 5' end is consistent 
with increased flexibility at that location detected by NMR 
(18). Sugar puckers (Supplementary Figure S6) are uni- 
formly higher (more south) in the adenine strand than in 
the thymine strand. 

It is informative to compare the simulated structures with 
available crystallographic and NMR data. Figure 5A shows 
MD values for A6 together with data for the three inde- 
pendent structures from a 2.3 A crystal (bdl047 (12)) and a 
structure from NMR with residual dipolar couplings (lfzx 
(75)). Figure 5B compares MD values for A3T3 with a 2.2 
A and 1.5 A crystal data (bdl038 (76) and bd0067 (36), re- 
spectively). The minor groove widths are in Figure 6. The 
B-DNA values are also shown in the figures (propeller is 
taken from (58), roll, twist and slide from (44), minor groove 
width from canonical B-DNA). All the experimental data 
show high negative propeller and narrow minor groove, in 
line with general structural features of A-tracts. However, 
they are highly variable in their numerical values. For in- 
stance, propeller of a given pair may span the range of 13° 
among different structures, the range of roll may attain 6°. 
Moreover, the crystallographic roll in Figure 5B is not sym- 
metric, although the crystallized sequence is palindromic. 
Thus, experimental structures of A-tracts may not be as in- 
variable as traditionally thought. This especially concerns 
propeller, which (together with buckle) has been identified 
as the most flexible structural parameter (42). MD values lie 
mostly within the experimental range, although the minor 
groove narrowing seems to be underestimated and the MD 
slide in A3T3 is too negative. 

Global stiffness and bending 

Besides the rigid base description, we also model our DNA 
oligomers at a longer scale as anisotropically bendable and 
twistable elastic rods (Figure 1). Their configuration is de- 
scribed by three global coordinates: bending angle towards 
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Figure 5. Comparison of equilibrium values of propeller, roll, twist and slide from MD simulations (red) with crystal structures (light and dark blue) 
and an NMR structure (black). The bdl047 crystal contains three independent structures, which are shown here individually. Generic B-DNA values are 
represented by horizontal grey lines. All the structures qualitatively capture A-tract properties, notably the high negative propeller. However, there are 
substantial quantitative differences between the individual experimental structures of A-tracts. 
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Figure 6. Minor groove profiles from MD simulations and from experi- 
mental structures, colour coding as in Figure 5. 



the grooves in the oligomer centre (global roll, p), bending 
angle in the perpendicular direction, i.e. towards the back- 
bone (global tilt, r) and total twist, co. The model is again 
parametrized from structural fluctuations in atomistic MD 
simulations, as described in Materials and Methods. It is 
characterized by equilibrium values of the three global co- 
ordinates and by the 3 x 3 stiffness matrix. 

We first focus on the A-tracts as such (without flanking 
sequences). To start with, we consider a simplified case of 
pure bending, while total twist is free to relax. This yields an 
effective 2x2 stiffness matrix with respect to global roll and 
tilt. The diagonal entries are stiffness constants for bending 
to the grooves (a g ) and to the backbone (ab), respectively. 
The off-diagonal terms are found to be at least an order 
of magnitude smaller than the diagonal entries. Thus, the 
global bending energy is to a high precision just a sum of 
two quadratic terms, expressing contributions from bending 
to the grooves and to the backbone. It follows from general 
properties of quadratic forms (see for instance (77)) that the 



stiffness constant for bending in any other direction lies be- 
tween a g and a^. 

The two bending stiffness constants are shown in Table 
2, together with the effective isotropic constant ai so given 
by the relation 2/a iso = 1 /a g + 1 /ab (78). The a iso values in 
Table 2 clearly show that, in terms of global bending, the 
A2n tract is always more flexible than the C/G-rich control 
sequence. In contrast, the A n T n tract is always stiffer than 
the control (and stiffer than A2 n ). The values of a iso for the 
control (72-75 nm) are higher than the consensus persis- 
tence length of generic DNA (50 nm) which, however, is a 
property of a very long stretch of random sequence and also 
contains a contribution from static conformational disor- 
der. 

Another simple deformation we consider is pure twist- 
ing, while bending is unconstrained. The twist stiffness con- 
stants c are in Table 2. Here again we see that A n T n is 
stiffer and A2 n more flexible than the control. The twist stiff- 
ness constants are comparable to values of ~100 nm ob- 
tained from single molecule manipulation experiments (79- 
82). Supplementary Table SI lists the complete 3x3 stiff- 
ness matrices which enable one to compute the deformation 
energy for simultaneous bending and twisting according to 
Equation (11). 

When inserted between the flanking sequences, our A- 
tracts induce bending to the DNA helix which decreases 
with A-tract length and is about the same in magnitude 
for A n T n and A 2n tracts (Table 1). The length dependence 
agrees with gel mobility data showing that A n tract bend- 
ing is maximal for n = 6 and then decreases with increasing 
n (83). The bending is towards the minor groove in the A- 
tract centre. The only outlier in these trends is Aio, gently 
bent to the backbone. We were unable to find any simple 
structural reason for this behaviour. The bending magni- 
tude (12° for A 6 and A 3 T 3 ) is lower than the 17°-21° ob- 
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Table 2. Global bending and twist stiffness for the A-tracts alone (without flanking sequences) and for the corresponding parts of the control. The average 
frames used to compute the properties are defined by bases in two consecutive pairs at the beginning, in the middle and at the end of each indicated tract 
(e.g. A A A A ATTTTT) . The errors are computed as in Table 1. 



Sequence 


Groove bending stiffness, 
a g (nm) 


Backbone bending 
stiffness, ai, (nm) 


Effective isotropic bending 
stiffness, a IS „ (nm) 


Twisting stiffness, C (nm) 


A3T3 


65(2) 


137(1) 


88(2) 


118(1) 


A 6 


49(0) 


118(1) 


69(0) 


102(1) 


GC6 a 


56 (2) 


112(2) 


75(1) 


117(2) 


A4T4 


91(1) 


94(3) 


92(1) 


135(1) 


A 8 


60(2) 


80 (0) 


68(1) 


101(1) 


GC8 a 


68(2) 


77(1) 


72(1) 


106(1) 


A5T5 


114(7) 


75 (3) 


90(4) 


150 (5) 


A10 


83 (3) 


61(3) 


70(1) 


113(1) 


GC10 a 


84(2) 


67(1) 


74(0) 


115(1) 



GCra denotes m central base pairs of the control sequence 



tained from cyclization experiments (84). However, those 
experiments were done at 10 mM MgCl2 salt concentration 
(84). There are indications that A 6 tracts may induce only 
a 7° bend in the absence of Mg 2+ (85), and a recent study 
provides evidence that A-tracts produce essentially no cur- 
vature in near-physiological concentration of monovalent 
ions (86). 

Finally, we investigate the global stiffness of A-tracts to- 
gether with their flanking sequences (12 bp in total, Table 
1). This property is just as relevant as the stiffness of the A- 
tracts themselves, since in some constructs (e.g. in (27)), a se- 
ries of phased shorter A-tracts interspersed by non-A-tract 
sequences is used instead of long A-tracts. Analysis exactly 
analogous to the above yields bending stiffness of sequences 
with embedded A n very close to the control, in agreement 
with experiment (27). In contrast, sequences with embed- 
ded A n T n are stiffer in bending. Similar results are found 
also for twisting stiffness (Table 1). 

We checked the dependence of the results on the choice of 
the global coordinates. Replacing the average frames (Fig- 
ure 1) by the base pair step middle frames from 3DNA 
yields results identical within error estimates. If we choose 
the step middle frames, local twist and local rise from 
Curves+ (87) instead, we obtain nearly identical shape and 
bending stiffness, while twist stiffness is uniformly about 
10% higher than in Table 2. Thus, the tested coordinate 
choices have negligible effect on the global shape and rel- 
ative stiffness of our sequences. 

DNA looping and nucleosome positioning 

A number of studies have reported that A n tracts are rel- 
atively depleted of nucleosomes in vivo, mostly due to the 
tracts' intrinsically lower nucleosome affinity (see the re- 
views (7,8) and references therein). As an explanation, it has 
been suggested that A n tracts resist the structural deforma- 
tions required for nucleosome formation (7). A very recent 
study on Schizosaccharomyces pombe (88) offers a fresh per- 
spective on the issue. It reports that A n tracts affect but do 
not deplete nucleosomes in S. pombe and that they prefer 
particular rotational positions. As a result, the A-tract frac- 
tion in the nucleosome changes in a periodic manner with 
the distance from the dyad. Thus, although the A n tracts 
may perturb nucleosome formation, the effect is likely to be 
more subtle and depend on the position of the tract within 



the nucleosome. This view is supported by a structural study 
of nucleosome core particles containing an Ai6 tract (89). 
The A-tract fragment conforms well to the topology typi- 
cal for nucleosomal DNA, but the DNA structure is locally 
distorted and the interactions between the DNA ends and 
the histone core are destabilized. 

To obtain insight into the A-tract behaviour in the nu- 
cleosome, we performed threading of the tracts through a 
high-resolution nucleosome crystal structure (lkx5 (90)). In 
threading, the tested DNA fragment is constrained to the 
DNA geometry at a particular location in the nucleosome 
and the energy needed to deform the fragment is recorded. 
The fragment is then shifted by 1 bp along the nucleosomal 
DNA and the calculation is repeated. As a result, one ob- 
tains the deformation energy as a function of the fragment 
position within the nucleosomal DNA. 

In our case, only roll, twist and slide were constrained 
to the nucleosomal x-ray values, all the other coordinates 
were left free to relax. This choice is based on the finding 
that roll, twist and slide are highly conserved among nucle- 
osome structures (91). Thus, we used the partially relaxed 
model (Equations (7) and (8)) to compute the deformation 
energy, with the coordinate vector containing roll, twist 
and slide of all the steps. Figure 7 shows deformation energy 
profiles for threading the A10 and A5T5 tracts compared to 
GC10, the central 10 bp of the control sequence. Results 
for the other tracts are analogous. The average deformation 
energy of A10 is 4 kcal/mol higher than average of the con- 
trol, and it rises by another 7 kcal/mol when passing from 
A10 to A5T5. These values should be understood as upper 
bounds, since they do not include full structural adaptation 
of DNA to nucleosome binding. Although the deformation 
energy of A10 is higher on average than the control, there 
are locations where the two energies are close to each other. 
In contrast, the A 5 T 5 tract is more resistant to deformation 
(have higher deformation energy) at nearly every position. 

The deformation energy in Figure 7 follows roughly a 10- 
bp periodicity pattern, as recently reported (88). A closer in- 
spection reveals this pattern to correlate with static bending 
of the A-tracts alone (without flanking sequences), shown in 
Supplementary Table S2. Note that this is different from the 
A- tract-induced bending (here reported in Table 1) where 
A-tracts and the flanking non-A-tract sequences are exam- 
ined together. All our A-tracts are slightly bent (by 3°-5°) 
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Figure 7. Deformation energy associated with threading of Aio and A5T5 
tracts and the central 10 bp of the control sequence (CG10) through the 
lkx5 nucleosome structure. Roll, twist and slide are taken from the nucle- 
osomal DNA, all the other coordinates are left unconstrained. The sym- 
metric A5T5 tract (green) resists deformation more than the asymmetric 
Aio tract (red), whereas the control sequence (blue) is incorporated more 
easily on average than the two tracts. However, the energy roughly follows 
a 10-bp periodicity pattern and there are positions where the Aio and con- 
trol values are close to each other, whereas the A5T5 values are higher at 
the majority of locations. 



towards the major groove in their centre, thus the energy 
is lower when the major groove faces the nucleosome core. 
The A-tract bending direction in experimental structures is 
inconclusive: the tracts are bent by several degrees to the mi- 
nor groove (bdl047), major groove (bdl038) or the backbone 
(bd0067 and lfzx) (Supplementary Table S2). Threading of 
A-tracts embedded in their flanking sequences yields more 
noisy profiles, but with nearly identical phasing as for the 
tracts themselves (Supplementary Figure S7). 

It has been pointed out that DNA sequences behave quite 
differently with respect to nucleosome positioning and with 
respect to looping (92). In particular, the asymmetric A n 
tracts, which may resist nucleosome formation due to their 
rigidity, are found to be highly flexible in the context of 
DNA looping (29). We hypothesize that this strikingly dif- 
ferent behaviour may be due to a different type of deforma- 
tion taking place in the two cases. On the one hand, DNA 
structure in the nucleosome, although flexible to some ex- 
tent (93), is under local constraint and may be deformed 
also with respect to translational degrees of freedom such 
as slide (94). The local displacement of the DNA bases then 
plays a major role. On the other hand, DNA looping only 
imposes boundary conditions at the ends and thus probes 
long length-scale deformation modes. A rather loose defini- 
tion of the looping boundary conditions, which also involve 
the protein flexibility (92), adds to the freedom of choosing 
the most favourable mode. 

Our data on the asymmetric A n tracts support this hy- 
pothesis. When rigid base coordinates are constrained in the 
deformation, then A n tracts appear more rigid than the con- 
trol G/C-rich sequence. We demonstrated this for special 
types of deformation involving individual coordinates, and 
by threading the A n tract through the nucleosome. In global 
bending and twisting, however, A n tracts are more flexible 
than the control. Note that, just as the rigid base coordi- 
nates, the global bending and twisting coordinates again de- 



pend only on the relative position and orientation of indi- 
vidual bases. Indeed, the global roll and tilt depend on the 
end and middle frames which, in turn, are means of selected 
base-fixed frames, and the total twist is just the sum of local 
twist values. Thus, both local and global deformations sam- 
ple the same underlying rigid base energy surface. The dif- 
ference, however, is that global deformations impose much 
less constraint on the rigid base displacements and thus en- 
able the system to use a more flexible deformation mode. 

Our results for symmetric A n T n tracts differ sharply from 
those for the asymmetric A2 n tracts. We find that A n T n 
tracts are stiffer than A2 n and the control both at the lo- 
cal and at the global, elastic rod levels. This indicates that 
A n T n tracts should not loop so easily, and should resist nu- 
cleosome formation even more than A2 n tracts of the same 
length do. 

Recently, artificial insertion of A n tracts of different 
length and purity into the yeast genome has been used to 
manipulate nucleosome positioning and thus fine-tune reg- 
ulation of gene expression (9,10). Our results suggest that 
by using symmetric A n T n tracts instead of asymmetric A2 n 
tracts of the same length, one should be able to obtain an 
even stronger nucleosome exclusion effect. Equivalently, the 
same exclusion effect could be obtained with shorter tracts, 
perturbing less the original genome sequence. 

CONCLUSION 

A-tracts are unique structural elements within double- 
stranded DNA which play a major role in nucleosome 
positioning and transcription factor binding. Their struc- 
tural features, such as high negative propeller, narrow mi- 
nor groove and bending induced to DNA, have been well 
characterized. In contrast, information about their mechan- 
ical properties has been contradictory. A-tracts appear rigid 
in some circumstances, such as nucleosome formation, but 
seem flexible in others, notably in DNA looping. To ex- 
plain these findings, more detailed information about A- 
tract stiffness is needed but is difficult to obtain experimen- 
tally. 

In this work we investigate mechanical properties of sym- 
metric (A n T n ) and asymmetric (A2 n ) tracts using a model of 
rigid bases interacting via a non-local harmonic potential, 
and another, more global model representing DNA as an 
anisotropically bendable and twistable elastic rod. We have 
found that the A-tract stiffness relative to a control G/C- 
rich sequence depends on the type of deformation they un- 
dergo. Asymmetric tracts are stiffer than the control with 
respect to localized deformations, but more flexible when 
global bending and twisting takes place. The localized con- 
straints play a role in nucleosome positioning, while global 
boundary conditions are imposed in looping. Both local 
and global deformations sample the same underlying base- 
base deformation energy, but global boundary conditions 
impose less constraint to base displacements and thus al- 
low the system to use a more flexible deformation mode. 
Our results, therefore, can reconcile the seemingly contra- 
dictory stiffness behaviour of the asymmetric A-tracts. 

The symmetric A-tracts are found to be stiffer than the 
asymmetric ones and stiffer than the control, both with re- 
spect to local and global deformation. Thus, our results pre- 
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diet that the symmetric A n T n tracts should affect nucleo- 
some formation even more than the asymmetric A2 n tracts 
do. It has already been found that the degree of nucleosome 
eviction depends on the length of A n tracts and the number 
of mutations interfering with the A-tract structure, which 
enables one to fine-tune gene expression by incorporating 
A n tracts into the genome. Our results open yet another pos- 
sibility, namely, using the symmetric A n T n tracts instead, 
which are expected to be more efficient in nucleosome ex- 
clusion. 

In summary, our work establishes detailed stiffness 
properties of the symmetric A n T n and asymmetric A2 n 
DNA A-tracts using coarse-grained models consistently 
parametrized from large-scale, explicit solvent MD simu- 
lations. The results can reconcile the seemingly contradic- 
tory stiffness properties of A-tracts, expose the differences 
in A n T n and A2 n tract mechanics, and have implications for 
gene expression manipulation using A-tracts. 

SUPPLEMENTARY DATA 

Supplementary Data are available at NAR Online. 
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